Low and Medium Level Vision Using Channel Representations
نویسنده
چکیده
This thesis introduces and explores a new type of representation for low and medium level vision operations called channel representation. The channel representation is a more general way to represent information than e.g. as numerical values, since it allows incorporation of uncertainty, and simultaneous representation of several hypotheses. More importantly it also allows the representation of “no information” when no statement can be given. A channel representation of a scalar value is a vector of channel values, which are generated by passing the original scalar value through a set of kernel functions. The resultant representation is sparse and monopolar. The word sparse signifies that information is not necessarily present in all channels. On the contrary, most channel values will be zero. The word monopolar signifies that all channel values have the same sign, e.g. they are either positive or zero. A zero channel value denotes “no information”, and for non-zero values, the magnitude signifies the relevance. In the thesis, a framework for channel encoding and local decoding of scalar values is presented. Averaging in the channel representation is identified as a regularised sampling of a probability density function. A subsequent decoding is thus a mode estimation technique. The mode estimation property of channel averaging is exploited in the channel smoothing technique for image noise removal. We introduce an improvement to channel smoothing, called alpha synthesis, which deals with the problem of jagged edges present in the original method. Channel smoothing with alpha synthesis is compared to mean-shift filtering, bilateral filtering, median filtering, and normalized averaging with favourable results. A fast and robust blob-feature extraction method for vector fields is developed. The method is also extended to cluster constant slopes instead of constant regions. The method is intended for view-based object recognition and wide baseline matching. It is demonstrated on a wide baseline matching problem. A sparse scale-space representation of lines and edges is implemented and described. The representation keeps line and edge statements separate, and ensures that they are localised by inhibition from coarser scales. The result is however still locally continuous, in contrast to non-max-suppression approaches, which introduce a binary threshold. The channel representation is well suited to learning, which is demonstrated by applying it in an associative network. An analysis of representational properties of associative networks using the channel representation is made. Finally, a reactive system design using the channel representation is proposed. The system is similar in idea to recursive Bayesian techniques using particle filters, but the present formulation allows learning using the associative networks.
منابع مشابه
Sparse Representations for Medium Level Vision
In this thesis a new type of representation for medium level vision operations is explored. We focus on representations that are sparse and monopolar. The word sparse signifies that information in the feature sets used is not necessarily present at all points. On the contrary, most features will be inactive. The word monopolar signifies that all features have the same sign, e.g. are either posi...
متن کاملThe Impact of Import and Export of Medium Technology Industries on Economic Growth of Iran
Technology spillover from trade channel is important. Absorbing foreign technology by the channel of importation and using the latent technology in those products create export development, increase the quality and quantity of manufactured goods, and modify management, technology improvements and production of goods with high added value; this might favorise optimal allocation of resources. Imp...
متن کاملStudy of Sea Level Variations during Passage of Meteorological Cyclones over the Qeshm Channel
The main purpose of this study is the survey of the atmospheric low pressure effects at sea level oscillations in the Qeshm Channel. For this purpose, the curves of the pressure on the Persian Gulf and Oman Sea, during the passing of the atmospheric low pressure system, are extracted across the region. During the time of occurrence of these events, sea level oscillations in the coastal areas of...
متن کاملLayered Representations for Vision and Video
Human vision, machine vision, and image coding, each demand representations that are useful and ee-cient. The best-established techniques today are based on low-level processing. Future systems for image analysis and image coding will increasingly use image representations that involve such concepts as surfaces , lighting, transparency, etc. These representations fall in the domain of \mid-leve...
متن کاملLearning Hierarchical Bag of Words Using Naive Bayes Clustering
Image analysis tasks such as classification, clustering, detection, and retrieval are only as good as the feature representation of the images they use. Much research in computer vision is focused on finding better or semantically richer image representations. Bag of visual Words (BoW) is a representation that has emerged as an effective one for a variety of computer vision tasks. BoW methods t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004